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1 .  Introduction. 

Our  primary  concern  is  the  comparison  of  two  treatments  in  a 
clinical  setting,  although  our  results  are  quite  general  and  this  problem 
is  only  one  application.  To  fix  the  discussion,  in  our  major  example  we 
assume  that  patients  are  recruited  sequentially  and  assigned  at  random 
into  one  of  the  two  groups.  Important  covariable  information  is  recorded 
at  the  time  of  recruitment.  The  response  variable  is  numerical  (e.g., 
systolic  blood  pressure,  cholesterol  level)  and  is  observed  either 
instantaneously  or  at  a  fixed  time  (from  recruitment)  in  the  future.  We 
assume  that  there  is  either  a  maximum  sample  size  or  a  target  sample  size 
N  (see  Section  3) . 

In  most  experiments  of  this  type,  the  investigator  will  want  to 
study  the  data  periodically  or  as  it  is  collected  and  (for  ethical 
reasons  if  no  other)  stop  if  obvious  differences  are  observed.  In  this 
instance  a  sequential  analysis  is  called  for,  possibly  based  on  repeated 
significance  testing  (RST)  (see  for  example  Siegmund  (1977,  1979)).  The 
usual  methods  for  such  tests  would  be  based  on  least  squares  estimates 
and  F-tests  for  a  linear  model.  Such  an  analysis  has  two  potentially 
serious  difficulties: 

(i)  if  there  are  outliers  or  if  the  error  distribution  is 

heavier-tailed  than  the  normal  distribution,  the  estimates 
will  be  inefficient  and  the  tests  will  have  low  power; 

(ii)  particular  design  points  (which  may  be  "outliers  in  the 
design”)  can  be  extremely  influential,  even  though  the 
observed  response  is  not  an  outlier;  this  could  cause 
difficulties  in  interpretation  at  the  very  least. 


To  overcome  the  first  problem,  Huber  (1977)  and  others  have 
suggested  the  '•classical'*  M-estimates,  which  give  bounded  influence  to 
outliers  in  the  response  but  unfortunately  give  unbounded  influence 
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to  outliers  in  the  design.  Recently,  there  have  been  a  number  of 
proposals  to  handle  problems  (i)  and  (ii)  jointly  (Maronna  and  Yohai  (1978)); 
these  are  the  "bounded  influence  regression"  estimates  (BIR) . 

Our  model  takes  the  general  form 

(1-1)  Yj  =  £il0  +  Y 0zi  U  =  1>2,...)  , 

where  the  {z^}  are  i.i.d,  with  distribution  function  symmetric  about 
zero.  The  parameter  Yq  is  a  scale  factor  to  be  explicitly  defined 
later.  For  our  purposes  it  is  convenient  and  not  unrealistic  to  assume 
that  the  design  vectors  {c^}  are  also  i.i.d.  and  independent  of  the 
errors;  modifications  of  this  assumption  are  discussed  in  Section  5. 

In  the  two-sample  problem,  we  will  write  B  =  (y  a  6*),  where  y  is  the 
intercept  and  a  is  the  treatment  effect.  An  individual  design  point  will 
take  the  form  (1  ±1  c„) ,  where  the  sign  is  determined  by  the  treatment 
group . 

All  the  estimates  mentioned  so  far  can  be  generated  in  the  following 
manner.  Let  (S^Cpxp)  ,  y  )  be  estimates  of  scale  which  converge  to 
(Sq,y0) ;  particular  choices  of  Yn  are  "Proposal  2"  (Huber  (1977))  or 
"MAD"  (Andrews  et  al.  (1972)).  Define  j3n  as  the  solution  to 


Jj  —  i®l  ^  —  i  Sn  —  i  Sn  —  OVS-i^V 


(1.2) 
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Particular  choices  of  g^,  g2,  and  \p  give  important  special  cases,  e.g.. 


Least  Squares 
Classical  M 
BIR 


Kj(x)  =  B2(x)  =  1  , 
gj(x)  ■  g2(x)  =  1  , 
Ixg^x)!  bounded  , 


lp(x)  =  X 
iji(x)  bounded 
il/(x)  bounded  . 


Special  cases  of  BIR  estimates  which  are  particularly  noteworthy  include: 


Mallows  (unpub 1 i shed) 
and 

Maronna  and  Yohai  (1978) 

Schweppe  (unpublished) 

Hampel  and  Krasker 
(unpublished) 

In  Section  2  we  state  the  major 
to  the  two-sample  problem  in  Section 
Extensions  are  discussed  in  Section 


g2(x)  =  1  ,  Sn  =  I 

g1(x)g2(x)  =  1  ,  sn  =  1 

gjWgjfx)  =  1  ,  gj(x)  =  1/x  . 

weak  convergence  results,  apply  them 
3,  and  prove  them  in  Section  4. 


2.  Major  convergence  results. 

Except  where  noted,  we  discuss  only  the  special  case  that  Sn  = 

(and  we  take  Sq  =  I  with  no  loss) .  Our  two  results  depend  on  whether  or 
not  is  monotone.  If  so,  our  technique  is  based  on  Yohai  and  Maronna 
(1979),  while  if  not  it  is  based  on  Carroll  (1978).  Since  there  are  two 
results,  we  have  chosen  to  list  groups  of  possible  assumptions. 

In  the  applications  we  are  going  to  assume  that  if  N  is  the  maximum 
sample  size,  then  one  takes  at  least  observations  before  stopping, 
where  ■*•<*> ,  K^/N  0 .  We  make  use  of  this  convention  in  this 
section.  The  proofs  and  assumptions  can  be  modified  to  the  case 
s  constant,  but  at  a  cost  of  notational  complication. 
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Define  the  process  {G^  C  * ) J  by 

(2.1)  Gn(s)  =  0  0  s  s  <  yN 

-  tNsi<l[Ns]-e0m0  nV1  VN  s  s  5 1  • 

Theorem  1.  Assume  A1-A7  and  B1-B5  below.  Then 


(2.2) 

Theorem  2. 

(2.3) 


-1  U  tNs] 

sup  [Ns]N~  | E.G  (s)  -  N'^  l  c!  g,  (c  .)i|»(g-(c  .)z.) 
OSsSl  i=l 


Assume  A1-A7,  B4  and  C1-C5  .  Then 


U  I1*?]  .  P 

SUP  I zign(s)  ■  N  ^  I  -i  Si Cc_  .)ip(g2(c  ,)z.)  |  -*■  0 
0<ssl  1  n  i=l  1  1  1  *  1  1 


P 

♦  0  . 


Clearly,  Theorem  1  gives  a  weak  convergence  result  for  the  process 

p 

[Ns]GN(s)/N  in  D  [0,1],  while  Theorem  2  considers  the  more  interesting 
process  GN(s) .  In  the  next  section  these  results  and  their  applications 
are  discussed. 

Let  (£,z)  be  distributed  as  (£j»£j). 

Al.  \p  is  bounded,  odd,  absolutely  continuous,  and  Lipschitz  of  order 
one. 

A2.  | £ | g ^  (£)  *  Mq,  g^(£)  >  0  almost  surely. 

A3.  E l£l g2 (£)  *  ^o»  ^(c.)  a  1  almost  surely. 
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A4.  There  exists  A^  >  0  such  that  for  every  | tj  £  Aq, 

2 

sup  h-2  E[iK(l+q+h)g2(c)(z-ct))  -  ♦((l+q)g2(£)(*-£i))l  <  00 

kl»lhNA0 

A5.  =  E£' £g1(c)g2(£)^' (g2(£)z)  is  of  full  rank,  and  as  |£|  0, 

r  -*■  0, 

|E£*  g1(£)<Kg2(£)(Ur)(z-£t))  +  Zjtl  =  o(|t|  +  |r|)  . 

A6.  Z2  =  E£*  £{gj  (c) ip ( g2 (c_)  z) }  2  is  of  full  rank. 

A7 .  lim  sup  E[^(l+q)g2(£)(z-£t))  -  <Kg2(£)z)]2  =  0  . 

^  |q|,|t|<C 

(This  actually  follows  from  A1-A4,  but  it  is  convenient  to  state  it 
separately.) 

Bl.  ^  is  monotone  nondecreasing  and  nonconstant,  and  if  D(u,z)  = 
0J>(u+z)-«Ku))/z,  then  |u|sa,  | z | ^ b  imply  D(u,z)  i.  d  >  0. 

B2.  P{ | z |  £  e}  >  0  for  all  e  >  0. 

B3.  The  minimal  eigenvalue  A  .  of  the  matrix 

mm 

EgjCcJgjfcJc'  £l{g2(£)s  a  ,  g2(£)  |£|sb} 
is  positive. 

ins]  ■  YoiN'%:  ° s  > s  d  ■  y»>  • 


B4.  sup{[Ns]|y 
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B5.  For  every  £  >  0,  there  exists  Q,N^  such  that  N  s  implies 

P{Q_1  <  Yn  s  Q  for  all  <  n  <  N}  2;  1  -  e  . 

Cl.  In  addition  to  Al,  ip  has  two  bounded  continuous  derivatives  except 
on  a  finite  set  8. 

C2.  \p  is  constant  outside  an  interval. 

C3.  The  distribution  function  of  g^(£)z  is  continuous  at  all  b  e  8, 


C4 .  For  every  £  >  0  as  N  *►  00 , 

P(!en  -  >  e  for  some  S  n  S  N}  +  0  . 

C5 .  For  every  e  >  0  as  N  +  00 , 

P{  I Y  -  Ynl  >  e  f°r  some  KM  £  n  <  N}  -  0  . 
n  U  N 

Remark  2.1.  Al  holds  for  the  common  M-estimators.  B1 

is  as  in  Yohai  and  Maronna  (1979),  while  C1-C3  are  needed  by  Carroll 

(1978). 

Remark  2.2.  A2-A3  are  not  unusual  for  BIR  estimates.  A4,  A5,  and  A7  are 

smoothness  conditions  corresponding  to  those  of  Bickel  (1975) . 

Remark  2,3.  When  ip  is  monotone,  B2-B5  are  specially  adapted  versions  of 
assumptions  in  Yohai -Maronna.  In  an  unpublished  manuscript,  they  have 
also  shown  C4-C5  when  gj(£)  =  1,  ip  is  monotone,  and  scale  is  estimated 
by  Proposal  2. 
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3.  Two- sample  problem. 

For  the  two-sample  problem,  the  treatment  effect  is  a'  a 

where  a*  =  (0  1  0. . .0) .  We  consider  the  situation  in  which  we  are 
allowed  to  take  at  most  N  observations  and  let  the  process  start  by 
taking  at  least  observations.  The  usual  RST  methodology  for  testing 
H^:  a  -  0  vs.  H^:  a  *  0  follows  this  form:  for  each  ^  n  <  N,  one 
computes  the  usual  F-  or  t-statistic  based  on  n  observations  and  compares 
it  to  a  cutoff  point  (possibly  depending  on  n,N).  is  chosen  and 
experimentation  stopped  if  for  any  £  n  <  N  the  test  statistic  is 
"large";  at  the  final  observation  (n  =  N),  a  new  cutoff  point  is  defined. 

Note  that  in  the  previous  section  we  have  shown  that  if  W  is 
Brownian  motion, 

[Ns]N_1  a'  GN(s)(a'  Ij1  l  E'1  a)-^  =>  sW(s) 

where  "*=>"  denotes  weak  convergence  in  D[0,1].  Let  be  a 

sequence  of  nonnegative  functions  in  D[0,1]  converging  (uniformly)  to  a 
continuous  function  C(*)  and  let  D  be  a  positive  number.  If  YQ,  Z^, 
and  w®re  *11  known,  then  a  general  class  of  RST's  which  fits  into 
the  framework  outlined  above  would  take  the  following  form: 

Take  an  initial  sample  of  KN  observations.  Reject  H 
in  favor  of  H ^  if  either  of  the N  following  obtains: 

(i)  [Ns]N_Js  |a’  3[Ns]|  >  ^(sjA^  for  some  K^/N  s  s  s  1; 

(ii)  N*|a'  6NI  >  DA^ 

(where  A^  *  y0  •'  Zj1  Z2  I'1  a). 

The  constant  A  can  be  estimated.  One  reasonable  estimate  is 
00 
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=  Y[Ns]  -  E1N^  E2N^  E1N^  - 

where 

1  tNrSl 

(3.1)  I1N(S)  .  (Ns)  i  CjgjfCjJgjCCj) 

and 

,  TNs] 

(3.2)  £2N(S)  .  [Nsl"1  £  c  !  CjUjtcp 

•  ♦(»2(£.i)(>’i-£ii[N5])/v[Ns)))2  • 

The  procedure  we  propose  is  the  same  as  above  but  with  A^fs) 
replacing  •  Before  analyzing  the  power  of  this  test,  we  need  the 
following: 

Lemma  3,1,  Under  the  assumptions  of  Theorem  19  C4  holds  if  we  also  assume 
that  for  all  L  >  0,  o  >  0  and  |©J  =  1, 

(3.3)  Hg1(£)(££)i^(ag2(c)  (z-Lc£))  <  0  . 

(We  will  analyze  the  power  of  these  tests  for  contiguous  alternatives  so 
the  term  a’  below  will  depend  upon  N,  though  this  will  not  be  explicit 
in  the  notation.) 

L 

Theorem  3.  Suppose  that  N  £'  ji  q  n  (finite),  that  C4  and  CS  hold,  and 
that  either  (2.2)  or  (2.3)  holds .  Suppose  further  that  i p  satisfies  A1-A7 
and 
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»  ■**£**" 


(3.4)  lim  E|c|g,(c)  sup  k' (g?(c) (1+r) (z-c  t))  -  ^'(g-(c)z|  =  0  . 

e->0  |r|se,|t|se  . 

Then  the  power  of  the  RST  converges  to 

(3.5)  P{|W(s)  +  sri*|  >  C(s)  for  some  0  £  s  £  1  or  |W(1)  +  H*|  >  D}  , 

where  n*  58  o(Yq  a1  E”1  E2  Z’1  a)"**  . 

Remark  3.1.  The  quantity  (3.5)  can  be  computed  from  Anderson  (1960).  In 
comparing  different  tests,  the  relevant  quantity  is 

2  -1  -1 
Yq  a'  Zxl  Z2  a  . 

Remark  3.2.  Siegmund  (1979)  has  considered  RST's  with  (essentially) 

CN(s)  =  (2 [Ns] 2  N-1  (exp (a/ [Ns])  -  1))** 

and  has  derived  approximations  in  the  normal  case  which  are  better  than 

(3.5) . 


Remark  3.3.  The  approximations  (2.2)  and  (2.3)  can  be  used  to  construct 
analogues  to  classical  F-tests  (see  Schrader  and  Hettmansperger  (1980) 
for  another  approach).  One  could  test  HQ:  K'  $  =  0  by  rejecting  for 
large  values  of 


C K 


-[Ns] 3 


[K’(E1N(s) 


-1 


J1N 


(s)-1)^]-1 


K' 


~[Ns] 
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4.  Proofs. 

The  proofs  of  Theorems  1-3  are  broken  into  a  number  of  steps.  In 

Lemmas  4.1 -4.4  we  make  the  assumptions  of  Theorem  1.  Also  define  Y  =  o 

and  y  =  O  • 
n  n 

Lenuna  4.1.  Define  £.(r)  ■  £  .gj  (  c  .)iKg2(c  .)z.(l+r)) .  Then  the  process 

i,  [Ns] 

H^(r,s)  =  N  l  £.(r) 

N  i=l 

on  D{[-Aq,Aq]  x  [0,1]}  converges  weakly  to  a  Gaussian  process  which  is 
almost  surely  continuous . 

Proof.  The  finite  dimensional  distributions  converge  by  A1  and  A2.  By 
Al,  A 2,  A4  and  the  remark  following  Theorem  3  of  Bickel  and  Wichura 
(1971,  p.  1665),  Hn  is  tight  (in  the  notation  of  their  equation  (3), 

Yl=Y2=2  ,  0^=62=!^ .  Define  the  continuous  process 

H*(r,s)  =  H^r.s)  +  (Ns- [NsJK^^CrjN'^ 

=  lyr.s)  +  op(l) 

uniformly  in  (r,s),  from  which  the  lemma  follows.  □ 

Lenuna  4.2.  Define 

t/o  [Ns]  1 /•?  1/2 

V*(r,s,t)  =  [NslN'5^  l  c.g1(ci)«Kg2(ci)(z.-citNA/Y[Ns])(l+rN1/Y[Ns]))  , 

VN(r,s,t)  =  V*(r,s,t)  -  V*(0,s,0)  -  EV*(r,s,t)  . 

Then  for  all  e>0  ,  M>0  there  exists  B  =  B(e,M)  such  that 
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P{  |VN(r,s,t)  |  >  e  for  some  |r  |< M  ,  BN""5  <  s  <  1}  <  e  . 

Proof,  Fix  £>0  ,  M>0.  For  any  B>0  ,  6>0,  A2-A3  and  the  fact  that  is 

Lipschitz  show 

sup{  | VN (r , s , t)  -  VN (r % s ,  0^)  |  :  BN“^  <  s  <  6  ,  |r|<  M  ,  1 1  |s£M}  =  0p(6)  . 
From  Lemma  4.1  we  can  choose  B  large  and  6  small  so  that 

P{{sup|VN(r,s,0)|  :  BN~^  <  s  <  6  ,  |  r  |  <  M }  >  e}  <  e/2  . 


Hence  it  suffices  to  prove  the  lemma  when  6  <  s  s  1  for  arbitrary  6; 
since  M  is  arbitrary  we  need  only  prove  the  lemma  for  the  process 
generated  by 


V*(r,s,t)  = 


_i  tNs]  ,  , 

^  £ i81(£i)'Kg2(£ i)(zi-£ i  tN'^)(l+rN'^)) 


With  this  new  process  V*  we  need  only  prove  tightness  since  (A2  and  A^) 
Var(V*(r,s,t))  0.  For  any  t*,  by  A1-A3 

sup{|V*(r,s,t)  -  V*(r,s,tj|:  0  s  s  s  1  ,  |r|<M  ,  |t  -  tjsn)  =  0p(n)  . 

This  means  we  need  only  prove  tightness  of  V*(*,*,t*),  which  follows  from 
A4  as  in  the  proof  of  Lemma  4.1.  □ 

Lemma  4.3.  For  every  M  >  0  , 

sup{n2|6n|N'3/2  :  1^  s  n  <;  MN1/2}  +  0  . 
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Proof.  Fix  e,M  >  0.  From  B3, 


sup(P(|£0j=O) :  |0|  »  1}  <  1  , 


so  that  for  every  D  >  0  there  exists  ri(D)  with 


(4.1) 


sup{P(0<|c9j<ri(D)):  |0|  =  1}  <  D/2  . 


Choose  Q  >  1  to  satisfy  the  inequality  of  B5  and  P ( J z  J>Q)  <  e.  Define 


Rn(0,L)  =  n-1  J  g1(£i)(£i£)'Kg2(£i)(2i- Lc.0)on)  . 


2iA  i  -3/2 

As  in  the  proof  of  Theorem  2.2  of  Yohai  and  Maronna  (1979),  n  |3.nlN  >  £ 
for  some  £  n  s  MN*  implies  |ji  |  >  eN^/M2  so  that  Rn(£»  £N^/M2)  >  0 
for  some  )0j  *  1.  Since  is  monotone  and  skew  symmetric,  with 
probability  at  least  l-2e  by  B5, 


=  h"1  l  g1(£i)l£iil^(0ng2(£iHsign(c  i0)zi- eN^jcj  0|M-")) 
i=I 


5  n_1  l  g1(£i)l£i£l’M0ng2(£i)tlzil- 


,-2, 


i=l 


s  Mq  supl^ln"1  l  I(|z.|  >  Q} 
i*l 


♦  n"1  l  g,(c  J  Ic  .  6|<KQ2  g2(c  .)  (l-eN^c  0|/M2Q3)) 

i=l  1  1  1  z  1  _1 

•  itUJ  *  Q} 


=  Rnl  *  Rn2  ‘ 


By  the  strong  law  of  large  numbers  (SLLN) , 
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lim  sup  RnJ  £  eMg  sup|i|/|  (a.s.)  . 

n-** 


Since  ip  is  monotone,  applying  Lemma  1  of  Yohai  (1974)  shows  that  for  every 

L  2 

L  >  1  (since  N  2/QM  -*>  <*>) 


(4.2) 


lim  sup{Rn2:  |£|  =  1,  <  n  s  N) 


(a.s.)  . 


Since  tp  is  monotone  nondecreasing  and  nonconstant,  lim  inf  <p(x)  <  0. 

x-*-_oo 

Therefore  from  (4.1),  gj(c)  >  0,  (A2),  dominated  convergence,  and  Fatou's 
Lemma,  one  can  choose  L  and  Q  sufficiently  large  so  that  the  right  side  of 
(4.2)  is  at  most  A^  <  0  and  A^  is  independent  of  e.  One  completes  the 
proof  by  choosing  e*  sufficiently  small.  0 


Lemma  4.4.  For  every  e  >  0  there  exists  M=M(e)>0  ,  L(e)  such  that  for 
N  sufficiently  large 

P{n|$n|N',s  >  L  for  some  MhT*  s  n  s  N>  <  4e  . 

Proof.  Choose  Q  as  in  B5  and  define 


RN(6,L,n)  = 


n 

I  gl(£i)(£ii),Kg2(c  .)(z.- Lc  .6)On)  . 


Let  L  =  M/Q.  As  in  Theorem  2.2  of  Yohai  and  Maronna  (1979), 
nl£nlN",S  >  L  implies  R^(£  ,  LN^/n  ,  n)  >  0  for  some  |0|  «  1.  When 
Q’1  s  °n  s  Q  (with  probability  at  least  1-e  from  B5)  uniformly  in 
|9|  =  1,  then  by  Bl, 
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rn(£  »  1**%  >  ")  5  Aj(n,N,  0)  -  A2(n,  0) 

i  n 

88  N'^  I  g1(£i)(£i£)'Kg2(£i)zi0n) 

i=l 

-  dMQ~2  n"1  l  g,(c.)g,(c,)(c  0)2 

i=l  A  1  z  1 

•  I{|z.|  <  Q'I}l{g2(c  .)<a,  g2(c  .)|c.|<b}  . 


By  the  SLLN  and  B3, 

lim  sup  A2(n,£)  £  ^MA  .  p^lzl  -  *!  (a.s.)  . 

n-+°° 

Therefore,  it  suffices  to  show  that 

sup  |A  (n,N,6)l  =  0(1) 

|0|=1  1  P 

MN^SnSN 


for  some  M. 

We  have  for  |0j  =  1  and  MN*5  £  n  <  N 

.  n 

|Aj (n,N,0) |  s  sup  |n’*  l  gjfc.Jc.  U%2 (c.) z.ojl  . 
MN^snSN  1=1 

Assume  M  >  1.  Then  from  B4  there  exists  M*  independent  of  M  (we  may 
take  M*  <  M)  so  that 

P{sup{[Ns]  |yjNsj  -  yq  IN-*5 :  0sssl}2M„}<e. 


Hence 
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P{sup{n|yn  -  YqIn"5*:  MN*4 


s  n  s  N}  ^  M*}  <  e 


or 


P{sup{  |y  -  Y0h  MN*4  i  n  5  N}  s  M.  N^n’1}  ^  1  -  e 


H  i 


Therefore  for  M  sufficiently  large 


p{sup{ |y  -  Y0|:  MN*5  <  n  S  Nl  <  YQ/2}  a  1  -  e  ; 


hence  for  some  not  depending  on  M 


p{|a  -  on|  i  M**  for  all  MN"4  <n^N}sl-e 

i  n  0 


Thus,  since  cr^  =  1, 


[Ns] 


sup  |A  (n,N,0)|  s  sup  |N‘  \  g  (c  )c  tp(g.(c,)s. (l+r))| 

e  |  =  1  1  osssl  i=l  1  1  1  1  1  1 

Os|r|sM», 


Op(l) 


u 

MN^SnSN 


by  Lemma  4.1. 


□ 


Proof  of  Theorem  1.  Fix  e  >  0  and  6  >  0.  Then  Lemma  4.4  shows 
that  when  the  sup  in  (2.2)  is  taken  over  K^/N  s  s  s  6,  (2.2)  is  0^(6) . 
Since  6  is  arbitrary,  it  suffices  to  prove  (2.2)  when  the  sup  is  taken 
over  a  s  s  s  1  for  any  a  >  0.  In  Lemma  4.2  set 

t*a0[Ns](SjNsj- 80)N‘J4 ,  r*[Ns](OjNsj/cr0- 1)N"^ ,  both  of  which  are  0p(l) 
(by  Lemma  4.4  and  B4) .  Then  use  A5;  since  a  s  s  s  1,  the  proof  is 


complete. 


0 


Lemma  4,5.  Assume  A1-A7  and  C1-C5 .  There  is  a  function  h(x)  +  0  os 


x  ->  0  for  which  the  following  obtains:  for  every  e  >  0  sufficiently 
small j  almost  surely  as  n  -*•  00  and  uniformly  in  |a-Og|se  ,  |j3  - 

,  n 

(4.3)  |n"  l  c^  gjCc  i){'Kg2(c  jJfyj-c  .  6)a)  -  $(g2(c  ±)  z.) 

i=l 

+  a£ig2(£i)^,(g2^-i'|2i^-"-0^ 

-  g2(£i)zi  'l',(g2(£i)ziKa-<V'H 
<  hCe)(|o-a0|  +  |6-  60l)  • 

Proof.  Set  0^=1,  f3g=0  without  loss.  Define 

M2  =  2  +  max{ | K |  ,  Ke8> 
d(e)  =  (2M2  e4)'1 
Afe)  =  {|£|g2(£)  *  d(e)> 

L(e,o,0)  =  { |g2(c)o(z-c  g)  -  k|  >  2ed(e)  for  all  KeB}  . 

Using  C2  choose  a  >  1  such  that  <p  is  constant  on  (-°°,-oi]  and 
constant  on  [a,°°).  Choose  e  is  so  small  that 

2o/(l+e)  -  ed(e)/(l-e)  z  a 


and 


2ae  +  ed(e)(l+e)  s  2ed(e)  . 

Now  suppose  that  |o  -  1 |  s  e,  |ji|  s  e,  (z,c)  e  L(e,cr,jS)  and 
c_d  A(e).  Then  if  g2(c)z  2  2a,  it  follows  that 
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g2(c)az  i  2a/(l+e)  2:  a 

and 

g2(c)o(z-c8)  *  2a/(l+e)  -  |g2(£)£6| /(1-e)  £  a 

so  that  iHg2(£)o(z  - £3))  =  iKg2(£)z);  similarly  g2(£)z  s  -2a  implies 
that  i()(g2(£)o(z -£g))  *  <Kg2(c)z).  If  |g2(£)z|  <  2a,  then 

|g2(£)(z-a(z-£3))|  <  |g2(£)z(a-l)  |  +  |g2(£)0£B| 

s  2ae  +  ed(e)(l+e)  s  2£d(e)  , 

so  that  g2(£)z  is  in  the  same  interval  between  points  of  B  as  is 
g2(£)a(z  -  £3^) ,  so  that  we  may  use  a  Taylor  expansion  to  bound  the 
summands  in  (4.3). 

If  | a  -  1 1  >  e ,  ||>J  >  E,  (z,£)  i  L(e,a,B),  or  £  e  A(e),  then  we  can 
use  the  fact  that  ip  is  Lipschitz  and  constant  on  (-°°,-a]  and  on  [a,00) 
to  bound  the  summands.  This  suggests  that  we  can  bound  (4.3)  by 
Cg{Anj  +  An2  +  Anj  +  A^},  where  Cq  depends  only  on  ip  and 

Anl  =  n_1  I  {g2(c.)(|o-l|  +  IcJlBl))2  I{|c.|g2(ci)  s  d(e) } 

An2  -  n'1  I  g2(£i)(|o  -  l|  +  |ci||B|)l{(zi,  £t)  t  L(e,a, 3)} 

*  *  d(e)} 

An3  *  n_1  l  g2(£i)(k-l|  ♦  |c1||3|)I{|c1|g2(ci)  a  d(e)} 

1  11 

An4  *  n‘  l  l'Kg2(£i)a(zi-£i£))  -  ’K82(£i)zi)|l{|£i|g2(£i)  *  d(e) }  . 


We  must  prove  the  bound  of  the  lemma  for  each  term.  This  is  easy  to  do 
for  Anl  and  by  applying  the  SLLN  and  using  A3.  For  A  since  ^  is 

Lipschitz  and  A3  holds,  we  need  only  prove  the  lemma  when  £  =  0.  Since 
^  is  constant  off  a  compact  set,  there  is  a  constant  with 


a^ib-o) 


la  -  1  ln_1  ^  82(£i)  lzi  I l£±  ll2(£i)^  »  82^— !zi  l~  Ml^ 


so  from  A3  the  lemma  holds  for  A^ .  Finally,  for  some  constant  M2, 

1  n 

An2  -  (l°-  1\  +  III)""  l  82(£A)  l£i|l{|82(£i)zi  -  K|  s  m2  ed(e) 

for  some  KcB}  , 

so  that  A3,  C3  and  the  SLLN  and  dominated  convergence  complete  the 
proof.  □ 

Proof  of  Theorem  2.  Fix  0.  Using  the  fact  that  Ej  is  of  full 

rank,  and  applying  Al,  A2,  B4,  C4,  C5  and  Lemma  4.5,  we  see  there  exists 
such  that  N  2  implies 

(4.4)  ptlt.n  ‘  6q|  +  l°n  *  a0l  >  e!  for  some  ^  s  n  s  N}  s  Cj 


(4.5) 


P(Nli{i>n 


ll^N' 


n 

l  c!  81(£i)*(82(£ i)zi)|> 


>  M2  for  some  s  n  s  N}  s  c2 
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(4.6)  PdzrV1  l  cVg^c  i){iKg2(c  i)a(yi- c  J  6))  -  ^(g2(£i)zi) 
i=l 

+  ocig2(c  i)4;'(g2(£i)zi)(£- 30) 

-  g2(^-i)Zi  ’•',(82(^i)zi)(O‘O0)}l  >  4hCe1)(l°“°0l  + 

for  some  s  n  s  N  and  some  lo-O^SEj  ,  ^  Q  |  ^  Gj )  ^  £j 

Choosing  8  =  6  ,  a=o  we  see  that  if  N  ^  M.  the  probability  that  the 

—  — n  n  l 

following  event  obtains  for  all  <  n  <  N  is  at  least  1  -2(e^+e2): 

(4.7)  n||n  -  e0|N_Ss  s  4h(El)n|8n  -  ^Q\n'H  +  2M2  . 

.  A  ,  ~U 

For  (4.7)  to  occur  we  must  have  (for  small)  n|$n  -  f*0|N  $  4M2 ' 

proving  that 

(4.8)  sup{n|6n  -  8JN**5:  K^n<N}  =  0p(l)  . 

A  l< 

Placing  n(^-60)N-^  into  (4.6)  with  a=on ,  j$=6n  completes  the  proof.  0 

Proof  of  Lemma  3,1.  This  follows  by  applying  the  technique  of  Lemma  4.3, 
B5,  Lemma  1  of  Yohai  (1974),  and  (3.3).  □ 

Remark .  Al,  A3,  and  B1-B2  imply  (3.4)  by  dominated  convergence. 

Proof  of  Theorem  3.  We  will  first  show  that  for  j  »  1,2 

sup{|E^N(s)  -  E.|;  1^/NssslJ  +  O  . 


(4.9) 


First  consider  j  =  1.  From  A2,  C4  and  C5,  with  probability  at  least 

1  -  e/2, 
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-1 

(4.10)  IZ1N(S)  -  lNsJ  i  £'i  Si^i (£ i)?2 (^2 i^ Zi^ ' 

i  [Ns] 

*  [Ns]"1  l  |c  |g  (c  ) 

i=l 

.  ,  sup,  |<',(*2(Ci)(l«)(z1- c  .  t)) 

|r|se,|t|se 

-  (g2(-i-)zi-)  I  * 


One  applies  (3.4)  and  the  SLLN  to  prove  (4.9);  a  similar  proof  using  A7 
and  Lemma  1  of  Yohai  handles  j  *  2.  Since  Z^  and  are  of  full  rank, 
the  power  becomes 


P{[Ns]2lT3/2|a'  |[Ns]|  >  [Ns]N_1  A^sJC^s) 

for  some  K^/N  s  s  s  1  or  |a_'  6^1  >  A^(l)D)  , 


from  which  (3.5)  follows  by  Theorem  1. 


□ 


5.  Extensions. 

(a)  It  is  possible  to  extend  the  results  to  classical  M-estimators . 

2 

In  Theorem  2  we  need  E | c^|  <  ®  .  For  Theorem  1,  A4  and  A7  must  be 

modified  so  that  Lemma  4.1  holds. 

(b)  We  have  dealt  with  the  case  in  which  the  design  vectors  {c  .} 
are  i.i.d.,  both  because  it  is  often  reasonable  and  because  it  simplifies 
the  exposition.  If  these  vectors  are  (i)  constants  or  (ii)  generated  in 
Section  3  by  blocking  on  each  group  of  two  patients,  it  is  still  possible 
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to  apply  our  techniques.  Detailed  and  reasonably  simple  conditions  can 
be  worked  out,  both  because  of  the  nature  of  BIR  estimates  and  because 
our  results  depend  in  large  part  on  easily  generalized  standard  SLLN  and 
weak  convergence  techniques.  The  major  difficulties  lie  in  (4.1)  and 
(4.2)  and  the  terms  in  Lemma  4.5. 

(c)  It  is  possible  but  extremely  messy  to  extend  the  results  to 
estimating  SQ  by  . 

(d)  In  Lemma  4.5,  the  major  use  of  symmetry  is  that  it  assures 

E_c  CO g2 zip f  ^ =  0*  For  Secti°n  3,  the  symmetry  is  unnecessary 
if  g j  (c)  =  gj(|cj)  (j  =  1,2)  and  the  patients  are  randomly  (with 
probability  h)  assigned  to  each  group,  for  then 

Eaf  c  g1(c)g2(c)zi|;f  (g2(c)z)  =  0  . 


(e)  The  approximation  (2.3),  while  only  of  first  order,  can  in  fixed 
sample  problems  give  some  information  about  the  dependence  of  the  design 
on  the  rate  of  convergence  through  Berry-Esseen  results  for 


l‘i  -1  M£iW*2(£p*i)  • 


Since  the  {c^}  are  i.i.d.,  when  y^  is  estimated  in  an  appropriately 
smooth  fashion  simultaneously  with  g  (as  in  Huber's  Proposal  2),  the  BIR 
estimates  are  minimum  contrast  estimates  of  a  vector  parameter.  If  has 
a  derivative  then  Berry- Esseen  results  and  Edgeworth  expansions  are 
available  (e.g.,  Pfanzagl  (1973)  and  Bhattacharya  and  Ghosh  (1978)).  For 

a 

the  Huber  function  i|i(x)  *  max(-b  ,  min(x,b))  when  the  design  is  bounded, 
it  should  be  possible  to  prove  a  version  of  (3.8)  in  Bickel  (1974)  and 
obtain  an  Edgeworth  expansion.  (In  the  location  case  with  scale  not 
estimated,  Jureckovfi  (unpublished)  has  verified  Bickel's  (3.8).) 


23 


REFERENCES 


Anderson,  T.W.  (1960).  A  modification  of  the  sequential  probability 

ratio  test  to  reduce  the  sample  size.  Annal  of  Math.  Statist.  37,  1, 
165-197. 

Andrews,  D.F.,  Bickel,  P.J.,  Hampel,  F.R.,  Huber,  P.J.,  Rogers,  W.H.,  and 
Tukey,  J.W.  (1972).  Robust  Estimates  of  Location:  Survey  and 
Advances.  Cambridge:  Princeton  University  Press. 

Bhattacharya ,  R.N.  and  Ghosh,  J.K.  (1978).  On  the  validity  of  the  formal 
Edgeworth  expansion.  Ann.  Statist.  6,  434-451. 

Bickel,  P.J.  (1974).  Edgeworth  expansions  in  nonparametric  statistics. 

Ann.  Statist.  2,  1-20. 

Bickel,  P.J.  (1975).  One-step  Huber  estimates  in  the  linear  model. 

J.  Amer.  Statist.  Assoc.  70,  428-434. 

Bickel,  P.J.  and  Wichura,  M.J.  (1971).  Convergence  criteria  for 

multiparameter  stochastic  processes  and  some  applications.  Ann.  Math. 
Statist.  12,  1656-1670. 

Billingsley,  P.  (1968).  Convergence  of  Probability  Measures .  New  York: 
John  Wiley  and  Sons,  Inc. 

Carroll,  R.J.  (1978).  On  almost  sure  expansions  for  M-estimates. 

Ann.  Statist.  6,  314-318. 

Huber,  P.J.  (1977).  Robust  Statistical  Procedures.  SIAM,  Philadelphia. 

Maronna,  R.A.  and  Yohai,  V.J.  (1978).  Robust  M-estimators  for  regression 
with  contaminated  independent  variables.  Unpublished  manuscript. 

Pfanzagl,  J.  (1973).  The  accuracy  of  the  normal  approximation  for 
estimates  of  vector  parameters.  Z.  Wahr.  und  Veru.  Gebiete  25, 
171-198. 

Schrader,  R.M.  and  Hettmansperger,  T.P.  (1980).  Robust  analysis  of 

variance  based  upon  a  likelihood  ratio  criterion.  Biometriha  67, 

93  and  102. 

Siegmund,  D.  (1977).  Repeated  significance  tests  for  a  normal  mean. 
Biometriha  64,  177-189. 

2 

Siegmund,  D.  (1979).  Sequential  x  and  F  tests  and  the  related 

confidence  intervals.  Stanford  University  Technical  Report  No.  134. 

Yohai,  V.J.  (1974).  Robust  estimation  in  the  linear  model. 

Ann.  Statist.  2,  562-567. 

Yohai,  V.J.  and  Maronna,  R.A.  (1979).  Asymptotic  behavior  of  M-estimators 
for  the  linear  model.  Ann.  Statist.  7,  258-268. 


